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ABSTRACT 


This paper presents a methodology for estimating initial mean and 
covariance parameters in a Kalman filter model from an ensemble of non- 
identical tests. In addition, the problem of estimating time constants 
and process noise levels is addressed. The work is motivated by practi- 
cal problems such as developing and validating inertial instrument error 
models from laboratory test data or developing error models of individual 
phases of a test. 
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1.0 INTRODUCTION 


This paper presents a methodology for estimating initial mean and 
covariance parameters in a Kalman filter model from an ensemble of non- 
identical tests. In addition, the problem of estimating time constants 
and process noise levels is addressed. The work is motivated by practi- 
cal problems such as developing and validating inertial instrument error 
models from laboratory test data or developing error models of individual 
phases of a test. 

Previous results in the literature [2,3] employ a Kalman smoother to 
obtain a sufficient statistic for the estimation of initial mean and 
covariance. Then the Expectation-Maximization (EM) algorithm [1] is 
applied to iteratively obtain maximum likelihood estimates of the entire 
initial mean vector and covariance matrix. The previous results are 
extended in this paper to account for parameter constraints such as con- 
straining variables that are physically unrelated to each other to be 
uncorrelated. Further, the results are extended to consider time con- 
stant and process noise level parameters. Previous techniques capable of 
estimating initial mean and covariance parameters and dynamic parameters 
require re-reunning a Kalman filter for each value of the parameter vec- 
tor considered. The new approach presented here is more efficient in 
that the filter need be re-run only for dynamic parameters. 

System testing (for example, of Inertial instruments in the labora- 
tory and error mechanisms from flight data) is often done in multiple 
phases that are physically different but linked dynamically in a given 
test. In order to obtain models for different phases, the previous 
results could be applied where the phase dynamics are stacked one on top 
of the other. New results are presented that provide a simpler and com- 
putationally improved approach that deals with each phase individually. 
The new results are also useful when only one multiple phase test is con- 
ducted, it is only desired to estimate the state in each phase, and the 
state is unobservable in a given phase but observable over all phases. 
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In practical testing situations, suboptimal filters are often used. 
Results are presented that account for filter suboptimality. 

Theoretical convergence results for the present application of the 
iterative EM algorithm are presented. Both the case of observable and 
unobservable per test dynamics are addressed. Also included are some 
references regarding rate of convergence and the effect of constraints on 
elements of the estimated covariance matrix. 
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2.0 THE DL/EM APPROACH 


The Data Likelihood (DL) algorithm was derived in [2, 3] during 
analyses directed toward the estimation of mean and covariance parameters 
of the initial distribution of certain discrete-time multi-dimensional 
Gaussian random processes. The essential idea is to use Kalman smoother 
(Bayesian) estimates of realizations of the initial conditions to esti- 
mate the mean and variance of these initial condition distributions and 
to test certain statistical hypotheses thereabout. The iterative DL 
scheme arose in the process of attempting to divorce the a priori model 
to be validated from the estimated model, and was observed to be of the 
form of the Generalized Expectation-Maximization method described by 
Dempster, et al [1]. 


The context of the problem is as follows. For each test, j , the 
realization of the r.v. x is assumed to be described by 


X 


k,j 


*k,jVl,j "^k,j 


, k“l,2,***,nj , 

j=l,***,N . 


( 1 ) 


and to be observed by 


z, . = M, .X, . + V, . 

k,j k,j k,j k,j 


( 2 ) 


Here w and v are assumed zero mean, white, uncorrelated, Gaussian, 

and independent of x^ . for all j . It is assumed that x^ . are 

, o,j 

realizations from a Gaussian distribution with mean w and covariance 
I , w and I unknown. In [2, 3] it is shown that the log likelihood 
function for the process can be written as 

log L(u,I) = log ^ » 


where 
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(3) 


log “ 


Y log |I+P(j)| 


J ^ 

A 

and R is independent of v , I . Here x. . are maximum likelihood 
(non-Bayesian batch least squares) estimates of , and P(j) are 

the associated estimation error covariances. Differentiating Log 
with respect to u and I and setting the derivative to zero yields the 
equations 


I ci*p(j)r* cL 

j-i 


j ci*p(j)r* (*o,j-p) 

J ■“ 1 

- I Cl+P(J)]'’ • 0 . 

j = l 


(5) 


It is fyrther noted that the Kalman smoother (Bayesian) estimates 

X . = X . (y.I) and associated estimation error covariance P.=P.(/j 
o,j o,j ^ ’ 1 , » , 

are related to x . and P(j) by 

0 » J 


x„ ,• Cl+P(J)] I 

^ 9J 




( 6 ) 


Pj » (P(j)‘M'^)"^ . 


(7) 
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Then the iterative DL loop can be defined; 


< N ft A A 

■ TT "oj ’ 


( 8 ) 


N ^ ^ 


J 

A 


(9) 




(10) 


A A 


A A 




( 11 ) 


with iteration on s . 


2.1 Theoretical Convergence 

Theoretical convergence of the DL algorithm is addressed in detail 
in references [4] and [5]. The first note begins by proving that the DL 
algorithm is a Generalized Expectation Maximization algorithm. It then 
follows that the DL algorithm produces a monotone increasing sequence of 
likelihoods. It also follows, under the additional assumption that there 
exists a pair £* , that maximizes the likelihood, that £* , is 

a fixed point of the DL algorithm. Finally, it is shown that the 
sequence Iglg-Q converges to some y^°, assuming that £ cl^ 
for all s and some c . 

There are two defects in this result. First, it is not guaranteed 
that , £° maximize the log likelihood. There is probably nothing to 
be done about this. Dempster et al [1] remark, as is probably true, that 
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will, in almost all applications, occur at a local, if not 
global, maximum of the likelihood. Second, it would be desirable to 
relax the assumption that 1 c . In [5] it is shown that t < cl 
is automatically true provided all the tests are identically set-up. It 
seems reasonable, but has not yet been proven, that a similar result 
holds in general. 

2.2 Constrained DL Estimates 


In many potential applications of the DL method, the random variable 
of interest (whose mean and covariance we desire to estimate) is of 
rather large dimension. It is also often true that several of the param- 
eters to be estimated are simultaneously poorly estimable and of rela- 
tively little interest. The judicious constraint of some parameters thus 
presents itself as a reasonable possibility. For example, if there is 
reason to believe that some components of the random variable are physi- 
cally uncorrelated, and any correlation is believed to be largely irrele- 
vant, little is likely to be lost if the estimated covariance is con- 
strained to exhibit zero correlation. 

It is desired to obtain those values u and I of the mean and 
covariance of the r.v. under consideration which maximize the log likeli- 
hood 


L = 


log L(m,I),^L = - 7 


N 

I log|I+P(j)| 


" 7 ^ [I+P(j)] ^ 



(12) 


subject to appropriate constraints. Three forms of constraints have been 
explicitly considered. Within the DL iterative context, it seems that a 
rather wide variety of constraints may be handled quite easily. 


7-7 


The general approach taken here is to use Lagrange multipliers to 
reduce the constrained maximization problem to a modified but uncon- 
strained problem. 


The first forms of constraint on u 
consideration here are 


[w^.] and I = under 


♦i = ° * 


’*’jk " ^ * 


(13) 

(14) 


Here and are "selectors", 


6 . = 


'0 if V. unconstrained 
1 

,1 if constrained 


(15) 


ro if <3.. unconstrained 

6. = i 

^ Ll if constrained 


(16) 


* 

Clearly, reasonableness dictates that ^nd = 1 be 

symmetric, and that I 0 . The problem now becomes that of obtaining 
an unconstrained maximum to 


L* = L*(y.I) = L + I X. s.{u..u*) 


^ j!k ^jk^*’jk"*’jk^ 


(17) 


(Again, it is clear that niust be symmetric.) To extremize L , 

we set 
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The explicit computations and solutions become rather tedious and 
are not reproduced here. The complete details and several examples are 
found in [6]. It should be noted that the use of Lagrange multipliers 
has an important advantage that is not mentioned in [6]. It is shown in 

. In words, solving 

Lagrange multipliers gives the sensitivity of the log likelihood to the 
constraint. 

Consider the case where the mean u is partitioned as p = 

into its constrained and unconstrained parts. It is rather straight- 
forward to show (see [6]) that the solution for u at each DL step is 
given by 

( 20 ) 


from which the effect of the constraints on y at each iteration is 

A 

clearly visible. (Here the state estimates x^. are also assumed 
parti oned as v , 




[7] that 


3L 

Jk 


* 

a., =a., 
jk jk 
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Such a formal solution is not generally available for the covariance 
equations. However, several specific cases yield results of some inter- 
est. One such case arises when ^ is partioned as 


and we wish to constrain 
Denoting 


11 

^2 

21 

^22 


h 2 = <2 > "21 ■ = <2 


and 222 “ ^22 * 


r*-M = M = 


Mil Mi2 
M2I M22 


( 21 ) 


where 




( 22 ) 


It is shown in [6] that 


^11 “ ^11 ^ ^2 hi ^12 ^12 ^22 ^21 


^12 ^22 ^22 "22 "21 ’ 


zzl E, 


( 23 ) 



One slightly disconcerting question which does arise though regards 

the positivity of in (23). It is presently not clear what condi- 

tions on ^ 12*^22 conditions on ^re more open 

to question since ^22 is more naturally restricted. Perhaps a relevant 

question is "how does one place a reasonable a priori constraint on a 

cross-covariance matrix?" One reasonable choice for the problem at hand 
* 

might well be ^^2 = 0 • In case , and the difficulty 

regarding definiteness disappears. 


A slightly different sort of result arises from the more specific 
desire to constrain r to be of the form 


E 


E 

E 


11 

^12 

21 

^22 


9 


with “ ^21 “ ® ^22 “ ^*^221^ * 

Then it is shown in [6] that the solution is 


11 " 


(26) 

12 “ 

. 0 . 

(27) 


f "2211 


= 

\ • 

(28) 


L °22iJ ■ « - • 



It is clear that the constrained lack of correlation forces 
and 1^22 exactly follow the data observed for each. It is important 
to note that the above result can be applied to cases where elements of 
1 12 or off-diagonal elements of 1^2 constrained to known non-zero 
values. Suppose the x^ ^ can be expressed as 
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( 29 ) 


^ X,. + x„. 

o,j Ij 2j 


where is uncorrelated with x^^ , the covariance of x^j is known 

and carries the known non-zero values mentioned above, and the covariance 

of Xjj is to be estimated subject to the constraints 1^2 ~ ^21 ~ ^ ’ 

^22 ~ results of equations (26), (27), (28) can 

be applied where x is re-defined to be 

o,j 


x^ . = w + X, . 

o.j Ij 


(30) 


A second form of constraint on u and I is 

* 

u = ay , and 

I = Si* . 


★ yit 


Here u ,1 are assumed to be given, and are undetermined 

constants. For purposes of analysis, these constraints may be more 
conveniently stated as 


♦l = UjM, . u.pj = n , 


'''jk ■ “n“jk ■ '’jk'^ll ' ” 


p>^i>^2. 

(31) 

Kj,k^ , 

(32) 

(j.k)/(l,l) 



The Lagrange multipliers ^-j »^jl^ » then enter in the unconstrained 
maximization of 


L • L t I + I . 


(33) 
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The explicit extremizing solution for o is given by 


a 




( 34 ) 


A similar but quite complicated analytic solution for 3 is obtained. 
As was the case for identity constraints, this value of 3 and 
derivational details are contained in [6], 

A third form of constraint of potential interest, applying only to 
the covariance estimation problem, is that of a specified correlation 
matrix, R = Cp,-,-] » so that the covariance takes the form 

I J 


I = diag (' 





( 35 ) 


with being the individual variance components. Then the constraint 

functions are 


i). . = a. . - /a. . p . . /a . . = 0 . 

ij ij 11 ij JJ 


(i>j) 


( 36 ) 


Clearly the usual properties of a correlation matrix are required. 
The modified likelihood function is then 


* 

L = 


L + 


* L+S 


I X. . . 

iPj 


( 37 ) 


We wish to find I such that • Again, the details of the 

solution are found in [6]. 


7-13 


2.3 Observability 


The effect of unobservable tests on the DL algorithm Is discussed in 
detail in reference [8]. The results can be summarized as follows: 

(1) The state space can be divided into an observable sub- 
space and an unobservable subspace. 

(2) On the observable subspace the DL algorithm performs as 
if the observable subspace is the whole space. Data and 
estimates of £ and u on the unobservable space have 
no effect on the DL algorithm as applied to the observ- 
able subspace. 

(3) The data and estimates on the observable subspace do 
effect the algorithms results on the unobservable sub- 
space and the correlations between unobservable and 
observable subspaces. 

(4) If the DL algorithm is initialized with zero correlation 
between observable and unobservable subspaces then the 
correlation will remain zero and the DL algorithm will 
not change the mean and covariance on the unobservable 
subspace. 

2.4 Estimation of Markov Parameters 


In addition to estimating parameters of initial distributions, it is 
often of interest to use data from multiple tests to estimate dynamic 
parameters of the system, particularly parameters of Markov processes. 
Several possibilities exist for such estimation, and three are discussed 
very briefly here. It should be noted that such estimation likely is 
most useful for consistency checking because of the innately poor identi- 
fiability of such parameters during system tests of short time duration. 
Further, all methods for their estimation are likely to be computation- 
ally costly. 
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In [9], Goodrich and Caines have presented a methodology for maximum 
likelihood idenfication of system parameters based on data from repeated 
independent tests. The likelihood function is based, as in the original 
derivation of [33, on Kalman filter innovations, and the assumption of 
independence of realization yields a rather tractable form. Methods for 
modification of the procedure to allow for correlated tests should be 
studied further. The computational burden can be high for this approach 
since multiple Kalman filter passes are needed at each iteration. 

Sun [10] has presented an application of the E-M procedure to the 
simultaneous estimation of system initial state, process and measurement 
noise levels, and system dynamics based on data from a single test. The 
paper indicates that an extension to repeated tests may be possible. 

Again, further study and extension seem necessary. 

A third possibility combines several aspects of the DL methods as 
previously described, the ideas of Goodrich and Caines, and other work in 
maximum likelihood estimation. 

Consider a dynamical system as described in equation (1), where ® 
and Q may depend on some parameter vector a (e.g. , time constants and 
process noise levels). Whether to solve for simultaneously or 

separately seems unclear as yet. For a given value of « , one might 
obtain via DL the maximum likelihood estimates u(«),^(®) • Then, fixing 
numerical /gradient methods could be used to obtain the value a to 
maximize the likelihood. Also open to question is the variability of 
a — one might assume « to be universally constant, constant over 
groups of tests or unique from test to test. 

An illustration of a possible implementation loop on such a proce- 
dure is found in Figure 1. We desire, again, to estimate »^=E(Xq^^) , 
I=E((Xo Markov parameter vector a . Dropping 

the individual test indicator i for the moment, we have 
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LIKELIHOOD - , 

COMPUTATION AND x ,P I BATCH 


G&C 










X 


0 


P + X 


9 


X the random part of x^ , and 
r 0 


p 

xr 


= I 


The observation z may be decomposed into 


z 


^xo " "r 


» 


a part due to x^ and a random part. In Reference [11] it is shown that 
the innovations may be decomposed as 

*0 + ' 4 - 

where zj^ is computed based on assumed filter and truth models of 
x^=0 . It is then possible to write the log likelihood function, given 
y,I,a, and indexing repeated tests by i, 

-2 log p(zju,I,a) + constant = 


I {log|I+P(i)|+(Xoj-i^)^ (I+P(I))'^ 

- 'og|p(1)|-x^‘, Pi’lXo.fl 

^!k ^i.kt- 


If a is known, this procedure reduces to the DL algorithm. If a 
is unknown then gradient procedures may be used in the maximization. 

This requires differentiating the estimate for « , but not for y,2 . 
Another idea is to solve for o for each test, and only for u,s for 
repeated tests. The proposed iterative loop is illustrated as the dotted 
closure in Figure 1. 
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In summary, the detailed analysis of the effect of Markov parameters 
is a difficult problem which has really only been recently addressed. It 
is felt that substantial additional effort may be required to fully 
develop adequate analysis methodology, but that failure to attempt to 
address the problem in detail may lead to inadequate analysis capabil- 
ities in some areas. 
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3.0 MULTIPLE PHASE THEORY 


Many systems are operated and evaluated in a sequence of phases. 

The analysis of performance in one phase is carried out relatively inde- 
pendent of results from other phases, and then results are combined at 
the end. Although this procedure is not a constraint for many systems, 
there is interest in studying it. This section presents the theory nec- 
essary to combine the results from several phases on one test, then to 
analyze the results using a cumulative methodology such as DL. An 
approximation is presented which allows for reasonably accurate quick 
cumulative evaluations. An extension of the theory is discussed at the 
end. 


Some framework, nomenclature, and assumptions need to be stated 
before the theory is presented. It is assumed that the effects of least 
squares estimates of errors from previous phases have been removed from 
the data prior to its analysis in a Bayesian per phase filter, or equiva 
lently removed after data analysis is complete. The per phase analysis 
is actually done with a Bayes filter, but the theory is developed start- 
ing with a least squares (infinite prior) filter. The least squares 
estimate of errors in the i^h phase can be represented by (see Refer- 
ence [12]) 


X. = V. + X,. + X. - Xj , (40) 

\s ' ^ ' ^i-1 


where 

= Systematic error in this phase, 

X = Random error introduced in this phase, 

-^i 

x^ = Residual estimation error from this phase due to , 

X- * Residual estimation error from previous phase due to Q^_j» R.,-_j 

^i-1 
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The covariance of this estimate is 


E((Xls.p)(*^5-u)T) = J, tP, *P^^ 


where 


E(x.xT) 

E(x, xl ) . 
'i-1 ^‘-1 


The error at the transition time can be represented by (see References 
Cll], [13]) 

Xj = C X. + (42] 


where the following statistics are obtained suppressing the i subscript 


E(x.x.) = P 


E(x^xj) = CPC^ + P' = 

E(i^x^) = PC^ = P^ 
E(Xtx7) = 0 


These statistics can be calculated following data analysis using a 
conventional Bayesian filter. If the initial states are augmented to the 
state vector to provide a fixed-point estimate of errors, all necessary 
covariances and correlations are obtained. For the state vector defini- 


(44) 


X* * 



the covariance of estimation errors obtained from a fixed-point Bayesian 
smoother will be 


P * = 
N 




(45) 


Using the above, the correlation matrix C and the covariance of 
transition-time errors which are independent of initial estimation errors 
can be calculated 


C 


= p-1 

^CN 


“ ^TN " 


(46) 


The max-likelihood information matrix is obtained as 



’’CN 

-1 


0 

^CN 

Ptn 


1 

o 

0 


(47) 


Assuming that it is invertible, the max-likelihood covariance is obtained 
and is given by 




— 


— 

— 


P 

' r 


p 

CP 

P* » 

Pc 

L 

s 

pc’’’ 

CPC^ + p-j. 


(48) 


The discussion thus far has focused upon manipulation of data and 
covariances from one phase of a multiple phase system. The result is the 
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max-likelihood covariance and by similar procedures, the estimate for the 
combined state vector at the initial and transition times. Since the 
error in the max- 1 ikeli hood estimate, as represented by Equation (40) is 
unbiased, the error in the estimate of the i^h phase, although corre- 
lated with the previous phase, is uncorrelated with all phases previous 
to that. Hence, the multiphase max-likelihood covariance of the max- 

likelihood estimate for the stacked vector of u.+x . vectors is of the 

1 n 

banded form (see Reference [12]). 


P 



-P 




-P 




0 

0 


-p. 


-p 


0 

0 




-P. 


-P. 




(49) 


This banded form has some interesting properties that lead to a use- 
ful result, especially when the following often practical assumptions are 
made: 

(1) The derivations presented have already assumed that all states 
in each phase are observable — so that the max-likelihood 
information matrix is invertible 

(2) Interphase correlations can be ignored for preliminary cumula- 
tive analysis giving an algorithm suboptimal in the sense that 
information is thrown away but not in the sense that an approx- 
imation is made. 


Although the first assumption may not always be true, it may be 
possible to redefine the state vector so that the unobservable states do 
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not enter the system until the phase in which they are observable. This 
can be accomplished automatically in a mathematical sense utilizing the 
Singular Value Decomposition (SVD) algorithm. Thus, although the first 
assumption may not be able to be satisfied explicitly, there are ways to 
accomplish its effect without degrading the fidelity of the model. 
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4.0 USE OF SUROPTIMAL STATE ESTIMATES IN MULTIPHASE ANALYSIS 


It is often the case that data from the several phases under consid- 
eration are obtained from Kalman-Schmidt filter/fixed point smoother 
algorithms. Thus the algorithm for processing a test phase should be 
capable of handling suboptimal gains. Even if the filter were optimal, 
the equations for the suboptimal case would be applicable, and, in some 
situations, might be preferable to equations assuming optimality. 

It is also desirable that the processing for each phase be done 
independently. In some cases, the processing for different phases may be 
done by different organizations. Thus, the per phase data reduction must 
use no information from other phases. The combination of phase estimates 
is done as the final step in the data reduction. 

Reference [14] defines the equations required for the phase data 
reduction. These equations are fairly general and would apply to most 
suboptimal filters. Also presented there are the additional recursive 
equations which must be computed in a consider filter so that the phases 
may be combined. These equations only apply to a Kalman-Schmidt filter 
(which automatically computes the correct covariance matrix) but could be 
modified for other suboptimal filters. The following section presents 
the algorithm for combining the suboptimal (or optimal) estimates from 
different phases. 

4.1 Multiphase Reduction Using Suboptimal Estimates 

The output of each phase will be a suboptimal, smoothed estimate of 
the state at initial time and transition time. Also obtained are the 
various covariance and sensitivity matrices. The true state at the epoch 
of each phase is assumed to have a mean and random component; i.e., for 
phase i 



( 50 ) 
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The DL method attempts to estimate and cov (x^) by combining 
results of different phases and tests. To do this, the state estimates 
for each phase are manipulated so that they are In the familiar form 

z = HXq + V . (51 ) 

Consider the combination of phase 1 and 2 shown in Figure 2. 

The value of x| is 


Phase 1 Phase 2 



1 

^T 


t^ t^ ^ 

^0 ^T ^m 

0 T 't 



Figure 2 Phase Combination 


directly included in the phase 2 initial condition, i.e. , 




(52) 


where T is a transformation matrix (not to be confused with transition 
time tj ) . 

Now consider the estimates obtained from phase 1: 


+ X^ + Xq 


Xy » + D 2 t'^j + v| 


(53) 


(54) 
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and from phase 2 


»o • 4 * * *0 


Dio(u^*xJ+Txj) + . 


(55) 


These equations can be combined in matrix notation as: 


^0 


~°]o 


0 


V 


■'J' 


-1 

^T 

= 

d]t<*'t 

»2T 

0 


“T 

+ 

v| 

(56) 

X > 
ro ro 

L _ 


_°10^*<’T 


°10_ 




1 

< 

o ro 



111 2 2 2 1 1 
where y = p + Xj and y = p + Xj . Notice that Vq and Vy are 

correlated but are uncorrelated with v« . Also notice that there is 

1 2 ^ 

no a priori information on y and y but that the a priori variance 

of Wt is Qt . Thus, equation (56) can be treated as three measure- 
II ^ ^2 

ments in a Bayesian least squares estimator for y‘ , Wj and y 


where 





is calculated in [14]. 
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1 2 

If all states are observable, then y and y should be maximum likeli- 
hood estimates. 

In order to better understand the result of this processing, we 
assume that the first two measurements in equation (56) were processed 
first (to estimate and Wj ) the third measurement was processed 
separately to estimate 

+ Tx^ . (57) 

This can be done because the measurement errors are uncorrelated. Then 

2 

we want to combine the estimates. Since y had infinite a priori 

variance, all information in the third measurement will be used to estimate 
2 1 

y if y and w^. were observable from the first two measurements, 

' 1 

i.e., the estimates of y and Wj will not change. Thus, 

and the covariance of the error in the estimate of 

"^Tl 

P 2 + TU,: I] P , -I- T^ . (59) 

y^ ' y 

The above analysis is similar to that given previously in the sense 
that data processing and requirements analysis can be done phase by phase. 


7-27 


5.0 BIBLIOGRAPHY 


[1] Dempster, A. P, , N. M. Land and D. B. Rubin, "Maximum Likelihood 
from Incomplete Data via the EM Algorithm”, J, Roy. Stat. Soc. , 
Series B, Vol. 39, No. 1, 1977. 

[2] Levy, L. J. , R. H. Shumway, D. E. Olsen and F. C. Deal, Jr., "Model 
Validation From an Ensemble of Kalman Filter Tests", Proc. 21st 
Midwestern Symposium on Circuits and Systems, 1978. 

[3] Shumway, R. H., D. E. Olsen and L. J. Levy, "Estimation and Tests 
of Hypothesis for the Initial Mean and Covariance in the Kalman 
Filter", American Statistical Association, San Diego, August, 1978. 

[4] Levine, W, L., "Convergence of the DL Algorithm", BTS, May, 1980. 

A 

[5] Levine, W, L., "Boundedness of in the DL Algorithm", BTS, 

July, 1980. 

[6] Haley, D. R. , "Modifications and Extensions to the DL Algorithm to 
Allow for Constrained Max Likelihood Estimation", BTS, May, 1980. 

[7] Varaiya, P. 0., Notes on Optimization , Von Nostrand Reinhold, New 
York, 1972. 

[8] Levine, W. L., "Behavior of the DL Algorithm for Unobservable 
Tests", BTS, July, 1980. 

[9] Goodrich, R. L., and P. E. Caines, "Linear System Identification 
from Nonstationary Cross-Sectional Data", IEEE Trans. AC. Vol. 
AC-24, No. 3, June, 1979. 

[10] Sun, F. K., "An Alternative Approach for Maximum Likelihood 
Identification", 18th Conf. Dec. Cont., Ft. Lauderdale, Florida, 
December 1979. 


7-28 



[11] Porter, D. W. , "Relationships Required to Implement Generalized 
Likelihood Approach", BTS, October, 1978. 

[12] Porter, D. W., To Appear, 

[13] Porter, D. W. , and T. S. Englar, Jr., "Multi -Object Tracking via a 
Recursive Generalized Likelihood Approach", 18th Conf. Dec. Cont., 
Ft. Lauderdale, Florida, December, 1979. 

[14] Gibbs, B. P., "Suboptimal Per-Phase Data Reduction", BTS, May, 
1980. 


7-29 


